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ABSTRACT 


All  integral  transforms  can  be  viewed  as  projections  onto  collections  of  func¬ 
tions  in  a  Hilbert  space.  The  properties  of  an  integral  transform  are  completely 
determined  by  the  collection  of  functions  onto  which  it  projects.  The  wavelet  trans¬ 
form  projects  onto  a  set  of  functions  which  satisfy  a  simple  linear  relationship  between 
different  levels  of  dilation.  The  properties  of  the  wavelet  transform  are  determined 
by  the  coefficients  of  this  linear  relationship.  This  thesis  examines  the  connections 
between  the  wavelet  transform  properties  and  the  linear  relationship  coefficients. 
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I. 


INTRODUCTION 


A  great  deal  has  been  written  about  wavelet  analysis  in  the  last  several  years. 
Much  of  this  work  however  is  presented  in  the  language  of  signal  processing.  This 
is  to  be  expected  since  wavelet  analysis  has  been  primarily  developed  in  the  signal 
processing  field.  This  paper  presents  the  discrete  wavelet  transform  in  a  projective 
framework.  The  discrete  wavelet  transform,  like  most  integral  transforms,  can  be 
viewed  as  projection  onto  a  new  basis  determined  by  a  given  collection  of  functions.  In 
light  of  this  the  properties  of  the  transform  are  primarily  determined  by  the  properties 
of  the  basis  functions.  The  development  of  a  transform  is  then  a  question  of  finding 
a  basis  which  will  produce  the  properties  we  desire. 

What  properties  do  we  desire  in  the  wavelet  transform?  We  would  like  the 
transform  to  incorporate  the  spectral  decomposition,  or  frequency  localization,  of  the 
Fourier  Transform.  In  addition,  we  would  like  to  have  time-localization  of  the  input 
function.  This  allows  us  to  determine  when  in  time  a  given  component  of  the  input 
function  occurs.  Frequency  localization  in  the  Fourier  Transform  is  accomplished  by 
projecting  onto  sines  and  cosines  which  have  non-compact,  in  fact  infinite,  support. 
This  precludes  the  Fourier  Transform  from  having  time  localization  capability.  Time 
localization  can  be  achieved  by  using  the  short  time  Fourier  Transform  (STFT), 
which  analyzes  the  input  function  in  small  time  windows.  This  time  localization  is 
fixed  in  scale,  depending  on  the  time  window  used.  Another  desired  property  of  the 
wavelet  transform  is  arbitrary  approximation  power.  This  is  useful  if  we  wish  to 
use  the  transform  in  a  data  compression  scheme,  that  is,  setting  to  zero  transform 
coefficients  below  a  certain  threshold  level  and  storing  the  input  function  in  terms 
of  this  reduced  coefficient  set.  Finally,  and  perhaps  most  importantly,  we  desire  the 
basis  onto  which  we  project  to  be  orthonormal.  This  will  allow  easy  inversion  of 
the  transform.  If  the  basis  is  orthonormal,  inversion  is  accomplished  by  weighted 
superposition  of  the  basis  functions  using  the  transform  coefficients  as  weights.  This 
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is  the  case  with  the  Fourier  Transform  which  uses  the  orthonormal  sine  and  cosine 
basis.  This  is  contrasted  with  the  Laplace  Transform  which  uses  an  exponential  basis 
which  is  not  orthogonal  over  the  real  line.  This  is  why  the  Laplace  Transform  must 
be  inverted  by  a  contour  integral  in  the  complex  plane. 

The  question  of  developing  a  transform  with  these  properties  is  reduced  to  that 
of  finding  a  basis  for  L2,  the  space  of  square  integrable  functions  on  the  real  line,  which 
possesses  the  desired  properties,  namely,  time  and  frequency  localization,  arbitrary 
approximation  power,  and  orthogonality.  If  we  restrict  our  attention  to  compactly 
supported  functions,  we  can  achieve  a  measure  of  time  localization,  however  only  at 
a  fixed  level  dependent  on  the  support  of  the  basis  function.  This  severely  limits  the 
frequency  information  available  as  well,  since  we  can  only  clearly  discern  the  frequency 
which  corresponds  to  the  basis  functions.  Compactly  supported  functions  are  easier 
to  compute  with,  hence  their  use  in  finite  element  methods,  and  we  would  like  to 
use  them  here.  We  need  to  find  a  way  to  use  compactly  supported  basis  functions 
without  limiting  the  time  or  frequency  information  available  to  us.  The  solution  is  to 
use  dilations  of  a  compactly  supported  function.  A  dilation  of  /  (a;)  is  /  (ax)  for  some 
a,  normally  a  >  0.  Notice  that  as  we  dilate  a  function  /  the  support  of  /  changes 
and  also  the  frequency  of  /  changes.  This  allows  us  to  cover  much  wider  time  and 
frequency  scales  than  using  basis  functions  of  fixed  compact  support.  To  give  us 
the  maximum  amount  of  flexibility  we  will  use  a  basis  composed  of  translations  and 
dilations  of  a  fixed  function  <f>. 

Consider  the  dilation  equation,  or  two  scale  recursion  relation 


(j>  ( x )  =  (2x  —  k)  for  some  {c^}  .  (1.1) 

k 

The  solution  is  completely  determined  by  the  dilation  equation  coefficients  {c*,}.  Also 
notice  this  dilation  equation  with  coefficients  {co  =  1,  cx  —  1}  is  satisfied  by  the  Haar 
function: 
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h(x)  =  { 


for  x  e  [0, 1] 
for  x  £  [0, 1] . 
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Figure  1.  Dilation  of  Haar  Function 

The  Haar  Transform  which  uses  as  a  basis  functions  derived  from  h(x)  is  in 
fact  an  early  example  of  a  wavelet  transform  and  predates  the  current  interest  in 
wavelet  analysis  by  several  decades. 

In  this  paper  we  show  that  solutions  to  1.1  exist  which  have  arbitrary  approx¬ 
imation  power  and  are  orthogonal  to  their  integer  translates.  We  further  show  how 
these  solutions  can  be  used  to  construct  a  basis  for  L2,  and  how  the  discrete  transform 
which  represents  projection  onto  this  basis  can  be  computed  in  order  N  operations. 
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II.  EXISTENCE,  UNIQUENESS, 
APPROXIMATION,  AND  ORTHOGONALITY 


A.  EXISTENCE  AND  UNIQUENESS 

Before  we  investigate  the  relationship  between  the  coefficients  of  the  dilation 
equation  and  the  properties  of  the  solutions,  we  need  to  determine  under  what  con¬ 
ditions  a  solution  in  fact  exists.  To  begin  our  search  for  solutions  to  the  dilation 
equation  let  us  consider  the  iteration 

<t>n+ 1  =  ^2  °k^n  (2X  ~~  *0  '  (H.l) 

k 

Solutions  to  the  dilation  equation  are  fixed  points  of  the  dynamical  system  defined 
by  this  iteration.  The  following  theorem,  which  we  present  without  proof,  gives 
conditions  under  which  square  summable  solutions  to  the  dilation  equation  exist  and 
is  found  in  [Ref.  1]. 

THEOREM  II.l.  If  the  {c*}  satisfy  the  following  conditions 

1.  {cjt}  is  a  finite  set, 

2.  Ylk  Cfc  = 

3.  |mo  (z)  |2  +  | m0  (z  +  tt)|2  =  1  for  all  z  e  R,  where  m0  (z)  =  ck^~lkz 

then  the  dynamical  system  in  II.  1  has  a  nontrivial  square  summable  fixed  point 
with  compact  support. 

In  the  same  paper,  a  similar  result  is  presented  for  distributional  solutions  to 
1.1  which  does  not  require  the  third  condition.  In  practice  we  are  looking  for  a  basis 
for  the  function  space  L 2,  so  we  restrict  our  attention  to  square  summable  solutions. 

Uniqueness  of  this  solution,  up  to  normalization,  is  guaranteed  by  the  same 
conditions  which  give  us  existence.  In  the  square  summable  case  the  normalization  is 
f  <f)dx  =  1.  Observe  that  this  normalization  requirement  is  consistent  with  condition 
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2  above  since 


/  ^  (*)<**  =  J  c*^  (2x  ~  *0 

=  \^ckJHy)dy  where  y  =  2x  -  k 
implies  =  1  or  ]Pcfe  =  2. 

fc  /t 

Before  proceeding  with  our  discussion  of  approximation  let’s  look  at  some 
solutions  to  1.1  which  are  familiar  to  us.  In  the  introduction  we  saw  for  coefficients 
{1,1}  the  solution  to  1.1  was  the  Haar  function.  Other  familiar  functions  which  are 
solutions  to  the  dilation  equation,  though  we  may  not  recognize  them  as  such,  are  the 
Dirac  delta  function  for  {2}  and  the  cardinal  B-splines  for  {  (”)  }  normalized  to  sum 
to  two.  The  Figure  2  shows  the  dilation  of  the  “Hat”  function  which  is  the  solution 
for  the  dilation  equation  for  {|,  1,  the  binomial  coefficients  for  n  =  2,  normalized 
to  sum  to  two. 

The  Cardinal  B-splines  illustrate  why  the  third  condition  in  Theorem  II.  1  is  a 
sufficient  but  not  necessary  condition  for  square  summability.  The  Cardinal  B-splines 
are  clearly  square  summable,  but  it  is  easily  seen  that  { (”) }  do  not  satisfy  the  third 
condition  of  Theorem  II.  1. 


B.  SOME  TOOLS  FOR  DILATION  EQUATIONS 

Before  proceeding  with  our  investigation  of  the  relationship  between  the  di¬ 
lation  equation  coefficients  {c*,}  and  the  corresponding  solution  <f>(x),  we  should  in¬ 
troduce  a  very  useful  tool  in  the  study  of  dilation  equations.  Dilation  equations  are 
difficult  to  study  because  only  in  special  cases  do  we  actually  know  what  the  solution 
function  is.  We  are  usually  limited  to  knowing  only  the  recurrence  relation  between 
different  scalings  of  the  function.  Faced  with  this  lack  of  information  we  must  turn 
to  the  Fourier  Transform  to  glean  any  information  we  can  about  the  function.  Let  us 
see  what  the  recursive  nature  of  1.1  can  tell  us  about  the  solution  <f>(x). 
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Now,  notice  the  integral  in  II. 4  is  *(§)  ,so  we  have 

Applying  this  process  repeatedly  to  <3>(£/2),  4>(£/4),  ...<I>(£/2”)  we  get 

=  ft  1 

j~  1  k 

Let  us  now  take  the  limit  as  n  — >•  oo, 

oo  1 

*«)= n?Ec‘e^i4,(°)-  (il5> 

j= 1  Z  k 

If  we  require  by  way  of  normalization  that  J  (j>dx  =  1  then  we  have  4>(0)  =  J  <f>dx  =  1 
and  II.5  becomes 

00  1  _ 

$(0  =  IliXlcfce^- 

j= 1  k 

Notice  we  could  write  this  infinite  product  as 

i=l  k  x  ' 

Now  we  see  the  importance  of  the  m0(z)  expression  in  the  third  condition  of  Theorem 
II. 1,  the  behavior  of  this  trigonometric  polynomial  determines  the  convergence  of  the 
infinite  product  representation  of  the  Fourier  Transform,  and  therefore  the  existence 
of  a  square  summable  solution  to  1.1.  A  fairly  simple,  yet  powerful  result  of  this 
infinite  product  representation  of  the  Fourier  Transform  is  the  Convolution  Theorem 
for  Dilation  Equations. 

THEOREM  II. 2.  Let  <j>  and  <p  be  solutions  to  the  dilation  equation  for  coefficients 
{cfc}  and  {d*}  respectively.  The  function  given  by  the  continuous  convolution  of 
<f>  with  ip  is  the  solution  to  the  dilation  equation  with  coefficients  {bj},  where  {bj}  is 
one  half  the  discrete  convolution  of  {c^}  with  {di}. 

Proof.  As  we  have  seen  the  Fourier  Transforms  of  <f>  and  <p  are  given  by 

OO  -j  OO  1 

$  (o = n  2  5Z  Cke~^  and  ^  (o = n  2  die 

71=0  k  71= 0  l 
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respectively.  The  product  of  these  transforms  4>\k  is  given  by 

OO  -j  OO  1 

^ = n  2 

n=0  k  m~  0  l 

We  can  combine  the  two  product  operators  by  associating  the  factors  where  n  =  m 
to  get 

00  i  i 

^ = n  9  ^2  Cke~^  o  yi  die^- 

n=0  “  fc  / 

•  — 

Observe  each  factor  in  this  infinite  product  is  a  product  of  two  polynomials  in  e  271  , 
namely,  |  ^2k  and  |  die Let  p  (£)  be  the  product  of  these  polynomials. 

Prom  polynomial  multiplication,  we  see  the  coefficients  of  p(£)  axe  given  by 

1  A 

b\  —  —  ^  ^  Cjd\—j , 

2i=o 

that  is,  the  set  {6^}  is  one  half  the  discrete  convolution  of  {c*}  with  {d{\,  and  the 
product  polynomial  p  is 

P(0  =  l^Zbxe^- 

X 

Substituting  this  into  the  expression  for  gives  us 

00  1 

n=0  Z  A 

Observe  the  right  hand  side  of  this  equation  is  the  infinite  product  representation 
of  the  Fourier  Transform  of  the  solution  to  the  dilation  equation  with  coefficients 
{6a}-  Thus,  ‘L'k  is  the  Fourier  Transform  of  the  solution  to  the  dilation  equation  for 
coefficients  {6j}.  Now,  from  the  convolution  theorem  for  Fourier  Transforms  we  know 
is  the  transform  of  4>  *  <p,  the  continuous  convolution  of  <f>  with  <p.  Since  Fourier 
Transforms  are  unique  this  completes  the  proof.  □ 

Armed  with  this  result  we  see  why  the  Cardinal  B-splines  are  solutions  to 
the  dilation  equation  for  the  binomial  coefficients  normalized  to  sum  to  two.  If  we 
develop  B-splines  from  a  repeated  convolution  standpoint  as  is  done  in  [Ref.  4],  the 
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relationship  between  B-splines  and  binomial  coefficients  becomes  transparent  in  light 
of  Theorem  II. 2.  To  generate  the  B-splines  we  start  with  the  Haar  function  which  we 
can  consider  the  “zeroth”  order  B-spline  (piecewise  constant  approximation),  we  get 
the  next  higher  order  B-spline  (piecewise  linear  approximation)  by  convolution  with 
the  Haar  function.  By  our  convolution  theorem  this  says  we  should  take  one  half  the 
discrete  convolution  of  {1, 1}  with  itself.  Convolving  {1, 1}  with  itself  we  get  {1,2, 1} 
the  binomial  coefficients  for  n  =  2,  which  we  must  normalize  to  sum  to  two.  Similarly, 
to  generate  the  (n  +  l)st  degree  B-spline  we  convolve  the  nth  degree  B-spline  with  the 
Haar  function,  so  we  convolve  the  coefficients  of  the  nth  degree  B-spline  with  {1, 1}. 
This  gives  us  the  binomial  coefficients  for  n+ 1,  which  again  we  must  normalize  to  sum 
to  two.  The  appearance  of  the  B-splines  is  of  more  than  passing  interest.  B-splines 
and  their  translates  are  often  used  as  a  basis  for  approximation.  This  is  very  similar 
to  the  scheme  we  are  investigating.  We  are  pursuing  the  goal  of  representing  L2  in 
its  entirety  rather  than  approximating  functions  in  a  subspace  of  L2.  Another  aspect 
of  spline  approximation  which  will  be  of  great  importance  in  the  next  section  is  the 
use  of  sliding  window  filters  as  a  tool  to  smooth  data.  Sliding  window  filters  can  be 
viewed  as  convolution  with  the  Haar  function.  Thus  repeated  application  of  sliding 
window  filters  is  equivalent  to  repeated  convolution  with  the  Haar  function,  both  of 
which  lead  to  smoother  data,  or  smoother  approximation  if  one  is  constructing  a  basis 
function,  as  we  shall  see  in  the  next  section. 

C.  APPROXIMATION 

Since  we  will  be  using  solutions  to  1.1  as  a  basis  for  approximation  we  would  like 
to  know  how  well  an  arbitrary  function  can  be  approximated  by  cj)  and  its  translates. 
Our  measure  of  approximation  power  will  be  polynomial  precision,  or  what  degree 
arbitrary  polynomial  can  be  exactly  represented  by  <j)  and  its  translates.  A  key  step  in 
finite  element  methods  is  to  approximate  a  function  by  the  translates  of  a  compactly 
supported  basis  function.  To  determine  the  approximation  power  of  solutions  to  the 
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dilation  equation  we  consider  the  work  done  on  the  finite  element  method  by  Strang 
and  Fix.  In  [Ref.  3]  they  show  that  the  following  statements  are  equivalent: 

1.  Any  polynomial  of  degree  less  than  or  equal  to  p—  1  can  be  exactly  represented 
by  a  linear  combination  of  <p  and  its  translates. 

2.  <F(0)  ^  0  and  ^-$(0  |?=2, r„=  0  for  n  6  Z,  n  ^  0  and  a  =  0, 1, 

The  first  of  these  statements  is  the  definition  of  approximation  power  that 
we  are  interested  in,  the  second  is  related  to  the  Fourier  Transform  of  the  function 
</>.  We  will  use  the  second  statement  to  construct  a  condition  on  {c*,}  that  will  give 
the  desired  approximation  power.  First  notice  that  T(0)  ^  0  is  taken  care  of  by  our 
normalization  requirement  since 


To  satisfy  the  rest  of  this  statement  we  must  look  at  the  infinite  product  representation 

of  ■NO 

n=l  A  k 

Since  this  product  converges  and  $(0)^0  we  see  that  the  product  vanishes  for  some 
value  of  £  if  and  only  if  at  least  one  of  its  factors  vanish  for  that  value  of  £.  Observe 
that  the  following  condition  is  sufficient  to  ensure  the  desired  behavior  of  (£): 

•  2  =  7r  is  a  root  of  order  p  of  m0  (z). 

In  the  previous  section  we  saw 

*  (0  =  fl  (Jr)  •  (n.6) 

Now,  for  all  l  €  Z,  l  ^  0,  there  exists  n  such  that  mod  27t  =  7r.  Therefore,  one 
of  the  factors  in  II.6  is  zero,  and  $  (27 vl)  =  0.  Now  consider  ^  [$  (£)], 
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Since  m o  ( z )  has  a  root  of  order  p  at  7 r,  the  right  hand  side  of  II.7  is  zero.  Similarly 
we  can  see  ^  [4>  (£)]  =  0  for  j  =  0, 1,  ..p  provided  m0  (z)  has  a  zero  of  order  p  at  7r. 

Unfortunately,  this  elegant  sufficient  condition  on  mo  (z)  is  not  necessary. 
Observe,  if  ^  [4>  (£)]  =  0,  ^  [m0  (7r)]  need  not  be  zero,  since  m0  (§) ,  which  is  a  factor 
°f  riyt^n  ^0  (Jr)  could  be  zero,  in  which  case,  II. 7  will  be  satisfied  regardless  of  the 
value  of  ^  [m0  (7r)]. 

The  sufficient  condition  for  the  approximation  power  of  </>  can  be  arrived  at  in 
several  ways.  In  [Ref.  2]  it  is  presented  as  a  sum  condition,  that  is: 

N 

^(-1)^  =  0  far  k  =  0,1,  ...,p-l.  (II. 8) 

/=o 

We  can  see  this  is  equivalent  to  our  condition  that  7r  be  a  root,  of  order  p,  of  m0  (z), 
by  first  expressing  the  sum  condition  as  a  matrix  equation, 


Kc  =  0 


where 


Kij  =  (-1  )jf. 

K  is  a  p  x  N  +  1  matrix  with  the  following  structure: 


1 

-1 

1 

-1 

1 

0 

-1 

2 

-3 

4  ... 

0 

-1 

4 

-9 

16  ••• 

0 

-1 

8 

-27 

64  ••• 

0 

-1 

16 

-81 

256  ••• 

p. 

• 

Calculating  the  null  space  of  this  matrix  yields  a  basis  for  the  null  space  which  is 
given  by  (  [i^] }  where  is  the  N  +  1  long  vector  whose  non-zero  entries  begin 
at  the  kth  position  and  are  the  binomial  coefficients  (p  for  Ar  =  1,2, ...,  N  -  p.  For 
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example,  a  basis  for  the  null  space  of 

1-11-1  1 
0-12-3  4 
0-14-9  16 


f 

1 

I 

O 

CO 

1 

3 

5 

3 

1 

3 

1 

O 

1 

j 

Since  this  is  a  basis  for  the  null  space  of  K,  any  {c*,}  which  satisfies  the  sum  condi¬ 
tion  II.8  must  be  a  linear  combination  of  the  basis  vectors.  Since  the  basis  vectors 
are  shifted  copies  of  one  vector,  the  matrix  whose  columns  are  the  basis  vectors  is 
Toeplitz.  So,  any  linear  combination  of  the  basis  vectors  can  be  represented  as  the 
product  of  this  Toeplitz  matrix  T  with  a  vector  whose  elements  are  the  weights  in  the 
linear  combination.  Alternatively,  we  can  consider  this  multiplication  by  a  Toeplitz 
matrix  to  be  discrete  convolution  of  the  vector  of  weights  with  the  vector  of  binomial 
coefficients,  i.e.  polynomial  multiplication  if  we  consider  the  elements  of  these  vectors 
to  be  coefficients  of  polynomials. 

So,  what  does  all  this  do  for  us?  Any  {c*}  which  satisfies  II.8  can  be  considered 
to  be  the  coefficients  of  a  polynomial  /  (a;)  which  in  turn  can  be  expressed  as  the 
product  of  two  polynomials  g  (x)  and  h  (x).  The  coefficients  of  g  (x)  are  given  by  the 
binomial  coefficients  (?)  where  p  is  the  required  approximation  power.  Recall  that 
the  polynomial  with  coefficients  (?)  is  g  (x)  =  (x  +  l)p.  That  is,  the  polynomial  with 
coefficients  {cfc}  has  a  root  of  order  p  at  x  =  —1.  Now,  if  we  consider  the  relationship 
between  the  trigonometric  polynomial  m0  (z)  =  ckeikz  and  the  polynomial  /  (x)  = 
J2k  ck%k,  we  see  f  (eiz)  is  equivalent  to  m0  (z).  Further,  if  /  (x)  has  a  root  at  x  =  —  1, 
then  m0  (z)  has  a  root  at  7 r. 
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Yet  another  way  to  see  the  sufficiency  of  our  condition  for  approximation 
power  is  to  observe  that  each  higher  order  of  the  root  of  m0  (z)  at  7T  is  equivalent 
to  an  additional  factor  of  (x+  1)  in  the  polynomial  whose  coefficients  are  { }  In 
turn,  each  of  these  factors  is  equivalent  to  the  discrete  convolution  of  a  smaller  set 
of  coefficients  with  {1, 1}.  Now,  by  our  convolution  theorem  for  dilation  equations 
each  of  these  discrete  convolutions  is  equivalent  to  the  continuous  convolution  of  a 
function  which  precedes  (f>  with  the  function  whose  dilation  equation  coefficients  are 
{1, 1}.  Of  course  the  function  with  dilation  equation  coefficients  {1,1}  is  the  Haar 

—  i£ 

function.  Recall  that  the  Fourier  Transform  of  the  Haar  function  is  - — which 
decays  like  |.  Each  factor  of  (x  +  1)  in  the  polynomial  whose  coefficients  are  {c*,} 
introduces  a  factor  which  decays  like  j  in  the  Fourier  Transform  of  (j).  Each  factor 
of  |  in  the  Fourier  Transform  implies  one  degree  of  smoothness,  or  approximation 
power  in  the  function  <f>. 

D.  ORTHOGONALITY 

As  stated  in  the  introduction,  an  orthogonal  basis  of  functions  is  advanta¬ 
geous.  Orthogonality  allows  us  to  reconstruct  the  input  function  from  its  transform 
coefficients  by  summing  the  basis  functions  weighted  by  the  transform  coefficients. 
This  is  the  case  in  the  Fourier  Transform.  By  contrast  the  Laplace  Transform  uses 
the  exponential  functions  {epx}  as  a  basis.  These  functions  are  not  orthogonal  over 
the  real  line  and  as  a  result  the  inverse  Laplace  Transform  involves  a  contour  integral 
in  the  complex  plane.  We  wish  to  take  advantage  of  this  ease  of  inversion,  so  we  will 
restrict  our  attention  to  orthogonal  basis  functions.  Again,  we  are  limited  by  not 
having  an  explicit  representation  of  our  basis  functions.  We  must  show  that  there 
exist  solutions  to  1.1  which  are  orthogonal  to  their  integer  translates.  Additionally, 
we  still  require  these  functions  to  have  the  desired  approximation  power. 

Fortunately,  such  solutions  do  exist.  In  fact,  we  have  already  seen  one  of 
them.  The  Haar  function  is  the  solution  to  the  dilation  equation  with  coefficients 
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{1, 1}.  Since  the  Haar  function  has  support  width  of  one  unit,  any  integer  translate 
of  the  Haar  function  has  support  which  does  not  intersect  the  support  of  the  original 
Haar  function  therefore  the  Haar  function  must  be  orthogonal  to  any  of  its  integer 
translates.  However,  the  Haar  function  has  poor  approximation  power,  and  is  not 
an  ideal  candidate  for  use  as  a  basis  for  an  integral  transform.  The  other  members 
of  the  Cardinal  B-spline  family  are  used  for  piecewise  approximation  by  higher  order 
polynomials  and  therefore  have  increasing  approximation  power  and  might  be  good 
candidates  for  transform  basis  functions.  Unfortunately,  the  higher  order  Cardinal 
B-splines  are  not  orthogonal.  This  is  easily  seen  since  each  of  the  B-splines  is  positive 
valued  over  their  support  and  the  product  of  two  B-splines  with  intersecting  support 
would  be  positive,  thus  the  inner  product  of  these  B-splines  would  be  non-zero.  In  this 
section  we  develop  both  necessary  and  sufficient  conditions  for  orthogonal  solutions 
to  the  dilation  equation. 

Let  us  first  look  at  the  consequences  of  this  orthogonality,  and  develop  a 
necessary  condition.  Suppose  (j>  (x)  is  orthogonal  to  its  integer  translates  <fi(x-  k) 
for  all  k  E  Z,  A;  ^  0.  Orthogonality  here  is  with  respect  to  the  inner  product  on  L2, 
that  is 

(/  0*0  ,  g  0*0)  =  [  f  0*0  9  {x)dx. 

So  far  we  have  only  considered  real  dilation  equation  coefficients.  We  will  continue 
to  restrict  our  attention  to  real  coefficients.  However,  the  following  development  is 
presented  allowing  the  possibility  of  complex  coefficients  to  be  consistent  with  the 
above  inner  product. 

Suppose  (j)  (.'/;)  is  orthogonal  to  <j)[x  —  k)  for  k  6  Z,  k  7^  0;  additionally  suppose 
(, t >  (2a;)  is  orthogonal  to  ( 2x  -  j)  for  j  e  Z,  j  ^  0,  then 

/OO  _ 

<t>  (a;)  (j)  (x  —  k)dx. 

OO 

Substituting  in  1.1  for  (f>  (a;)  and  (f>(x  -  k)  we  get 

/OO  poo 

<f)  (a;)  (j)  (x  —  k)dx  —  /  ^  ci<j)  (2a;  -  l)  y ~^c^>  (2  (x  -  k)  -  m)dx. 

00  J —OO  j  m 

i  m 
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Interchanging  the  order  of  integration  and  finite  summation 


/oo  _  _  _  poo  _ 

4>  (x)  (f>  (x  —  k)dx  =££  QCm  /  <f>(2x-l)</>  (2 

-OO  i  ™  J —OO 


x  —  2k  —  m)dx.  (II. 9) 


Since  (j)  (2x)  is  orthogonal  to  (j)  (2x  -  j)  for  j  €  Z,  j  ^  0  the  integral  on  the  right  hand 
side  of  II. 9  must  be  zero  unless  l  =  2k  +  m.  This  simplifies  II. 9  to 

/OO  _  _  poo  _ 

(j)  (x)  4>  (x  —  k)dx  =  c/Q_ 2fc  /  4>  (2x  —  l)<f>  (2x  -  l)dx. 

-OO  l  J —oo 

So,  if  (j)  (x)  is  to  be  orthogonal  to  <f>  (x  —  k)  we  require 

/OO  _ 

(2x  -l)<t>  (2x  -  l)dx  =  0. 

i  -°° 

And  since  we  assume  <f>  is  not  identically  zero,  the  above  integral  cannot  be  zero,  thus 


y;  C/Q_2fc  =  0  for  k  6  Z,  k  ^  0. 


(11.10) 


Additionally,  when  k  =  0 


/OO  _  _  poo  _ 

<j>  (x)  (j)  (x)dx  =  c{c~i  /  (j)  (2x  —  l)(j)  (2x  —  l)da 

-OO  /  J —  oo 


Let  y  =  2x  —  l  in  the  right  hand  integral  and  we  get 


/oo  _  1 

<f>  (x)  (j)  (x)dx  =2^2 


cici  /  (j){y)(j){y)dy. 


For  this  equation  to  be  true  we  must  have 


YlClCi  =  2- 

l 

We  can  combine  these  to  get  a  necessary  condition  for  orthogonality 

^  ^  qq_2/c  =  2 5k  for  all  k  G  Z 


am) 


where  5^  is  Kronecker’s  delta  function:  5k  =  1  for  k  —  0,  8k  =  0  for  k  ^  0. 

It  appears  that  II.  10  is  yet  another  condition  we  are  imposing  on  the  dilation 
equation  coefficients.  However,  with  a  little  work  we  will  see  that  this  necessary 


condition  for  orthogonality  in  II.  10  is  equivalent  to  the  third  sufficient  condition  for 
square  summability  in  Theorem  II.  1,  expressed  in  terms  of  the  coefficients  rather 
than  in  terms  of  the  polynomial  m0  (z).  Let  us  start  with  the  third  condition  from 
Theorem  II.  1, 

|m0(z)|2  +  |m0(z  +  7r)|2  =  1  for  all  z  e  R  (11.12) 

where  m0(z)  =  ~\ckelkz.  (11.13) 

k 

First,  observe 

mo(z  +  *)  =  1 C^*W)  =  \  E (H.14) 

k  k 

Now,  if  k  is  even,  etklT  =  1,  and  if  k  is  odd,  elklT  =  —  1  so 

m0{z  +  7r)  =  \  (~l)fc  cketkz. 
k 

Recall  |C|2  =  CC  which  leads  us  to 

|rao(.z)|2  +  | m0(z  +  7r)|2  =  m0(z)m0  {z)  +m0(z  +  ir)  m0  ( z  +  7 r). 

Substituting  in  the  expression  for  mo  (z)  from  11.13  and  the  expression  for  mo  (z  +  7 r) 
from  11.14  we  now  get 

\m0{z)\2  +  |m0(2:  +  7r)|2  = 

5  E  ^ik‘l'Ec‘e“‘ + 5  E  (-1)*  c^k*\  E  (-1)'  c‘e<h ■  (1U5> 

k  l  k  l 

Carrying  the  conjugation  down  into  the  trigonometric  polynomials  and  simplifying 
we  get 

|m0(z)|2  +  |m0  {z  +  7r)  |2  = 

'E7‘e~“’  +  jE  (-1)1  c*e“*  E  (11.16) 

k  l  k  l 
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This  sum  is  a  Laurent  polynomial  in  elz.  Keeping  in  mind  that  this  sum  must  equal 
one  for  all  z  €  R,  we  see  that  the  coefficients  of  (elz)n  for  n  7^  0  must  be  zero  since  the 
sum  is  real-valued.  We  need  an  expression  for  the  coefficient  of  (elz)n;  with  a  little 
work  we  can  see  this  expression  is 

bn  =  \  X)  +  \  (-!)”  c 

3  3 

Now  notice,  if  n  is  odd  then  bn  is  zero  automatically.  If  n  is  even  then  we  can  say 
n  =  2m  for  some  m  E  Z,  and 

bn  =  2  ^  y  cjcj-2m  —  0 
3 

thus 

CjCj_2rn  =  0  for  m  €  Z  ,  m  /  0 

3 

when  m  =  0,  bn  =  1  and 

J2c^  =  2- 

3 

Combining  these  conditions  we  get 

CjCj—2m  =  2 Sm  for  all  m  €  Z 

3 

as  in  condition  II. 10.  This  chain  of  arguments  can  be  reversed  to  derive  11.12  from 
II. 10.  So,  we  have  shown  the  equivalence  of  the  two  conditions.  If  we  look  for 
orthogonal  solutions  to  the  dilation  equation,  the  coefficients  necessarily  satisfy  the 
sufficient  condition  for  square  summability. 

This  is  all  well  and  good,  but  we  still  need  to  find  a  sufficient  condition  on 
the  dilation  equation  coefficients  to  ensure  orthogonality.  There  is  a  necessary  and 
sufficient  condition  for  orthogonality  of  the  solutions  of  the  dilation  equation.  This 
condition  can  be  found  in  [Ref.  5],  where  it  is  expressed  in  terms  of  the  location  of 
the  roots  of  the  trigonometric  polynomial  mo  ( z )  as  defined  in  Theorem  II.l.  This 
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condition,  while  it  has  the  advantage  of  being  both  necessary  and  sufficient,  is  a  little 
cumbersome.  We  will  instead  present  a  slightly  stronger  condition  on  the  placement 
of  roots  of  mo  (. z )  which  is  also  sufficient  for  orthogonality.  This  condition  is  found  in 
[Ref.  5]  as  well  and  relies  on  the  necessary  and  sufficient  condition  mentioned  above. 
We  present  it  here  and  refer  the  reader  to  [Ref.  5]  for  proof. 

THEOREM  II. 3.  Let  (f)  be  the  solution  to  the  dilation  equation  for  coefficients  {e*}. 
Suppose  m0  ( z )  =  \  Yjk  °kelkz  satisfies  |m0  (z)  |2  +  |m0  (z  +  7r)  |2  =  1  for  all  z  eK  ,  and 
m0  (0)  =  1.  If  mo  (z)  has  no  zeros  in  [—§,§],  then  (f>  ( x )  is  orthogonal  to  fi{x  —  k ) 
for  all  k  €  Z,  k  ^  0. 

Notice  since  we  required  ^2kck  =  2,  we  automatically  satisfy  m0  (0)  =  1. 
Also,  the  sufficient  condition  for  approximation  power  was  that  mo  (z)  have  roots  of  a 
given  multiplicity  at  ir,  it  is  encouraging  that  this  does  not  conflict  with  the  sufficient 
condition  for  orthogonality. 

Well  known  examples  of  compactly  supported,  orthogonal  solutions  to  the 
dilation  equation  with  higher  approximation  power  than  the  Haar  function  are  the 
scaling  functions  associated  with  the  Daubechies’  Wavelets  [Ref.  2],  We  will  describe 
what  a  wavelet  is  in  the  next  chapter. 
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III. 


THE  DISCRETE  ORTHOGONAL 
WAVELET  TRANSFORM 


A.  DECOMPOSITION  OF  THE  FUNCTION  SPACE  L2 

In  the  previous  chapter  we  saw  that  we  could  find  solutions  to  the  dilation 
equation.  We  also  saw  that  these  functions  could  be  made  to  have  a  specified  ap¬ 
proximation  power  and  be  orthogonal  to  their  integer  translates.  In  order  to  develop 
a  transform  based  on  these  functions  we  must  show  that  a  basis  for  L2,  the  space  of 
square  integrable  functions  defined  on  the  real  numbers,  can  be  formed  from  solutions 
to  the  dilation  equation.  As  we  have  previously  seen,  the  solutions  to  the  dilation 
equation  are  square  integrable  provided  the  dilation  equation  coefficients  meet  some 
basic  conditions.  In  this  section  we  will  show  that  we  can  derive  from  the  solution 
to  the  dilation  equation  a  basis  for  L2,  and  that  we  need  not  impose  any  additional 
conditions  on  the  dilation  equation  coefficients  to  accomplish  this. 

Let  us  consider  the  space,  call  it  Vo,  of  functions  which  can  be  expressed  as 
linear  combinations  of  the  solution  <f>  to  the  dilation  equation  and  its  integer  translates. 
Since  <j>  is  square  integrable,  V0  is  a  subspace  of  L2.  Let  us  also  consider  the  space, 
call  it  VI  i,  spanned  by  (f>  (2x  —  k )  where  k  €  Z  ,  this  space  is  composed  of  functions 
which  are  linear  combinations  of  \/2 (j)  (2x)  and  its  half  integer  translates.  We  have 
scaled  the  function  by  \/2  so  that  the  norm,  in  L2,  of  the  prospective  basis  function 
will  be  one,  this  has  no  impact  on  the  space  spanned  by  the  functions.  Again,  since  (f> 
is  square  integrable  (j)  (2x)  is  also  square  integrable  and  V-\  is  a  subspace  of  L2.  If  we 
continue  in  this  manner,  taking  spaces  spanned  by  (f>jk  =  2 (2~^x  —  k),  we  get  an 
infinite  sequence  of  subspaces  of  L2,  namely  V0,V-i,  V_2, ....  If  we  also  allow  positive 
indices  for  these  spaces,  which  implies  negative  powers  of  2  in  2~M2<j)  (2~^x  —  k )  we 
get  a  bi-infinite  sequence  of  subspaces  of  L2 

Vj  —  linear  span  {</>  (2~J£  —  k)  \  k  €  Z}  for  j  €  Z. 

This  sequence  of  subspaces  has  the  following  properties. 
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1.  The  Vj  form  a  nested  sequence,  that  is  Vj  C  Vj_ j. 

Suppose  f  e  Vj  then  /  is  a  linear  combination  of  <f>{ 2“-?x  —  k).  Since  <f>  is 
a  solution  to  the  dilation  equation,  <f>  (2-Jx  —  k)  is  a  linear  combination  of 
(j)  (2-U-Vx  -  k),  thus  Vj  C  Vj- 1. 

2.  /  (x)  €  Vj  if  and  only  if  f  (2x)  €  Vj- 1. 

Suppose  /(x)  =  ak(j)  (2~ix  —  k),  then  /  (2x)  =  X]  ak4>  (2_J  (2x)  —  k)  = 
Oik4>  (2~^~i)x  —  £)  thus  /  (2x)  6  Vj-i.  Conversely,  suppose  /(2y)  6  Vj- \ 
then,  f  (2y)  =  X^afc^  (2_(j_1)y  -  fc)  =  ]To:fc</>  (2_:?’(2y)  -  A:).  Now  let  x  =  2y, 
and  we  get  /  (x)  =  X] akf>  (2~Jx  —  &),  thus  /  (x)  E  Vj. 

Djez^i  =  {0}- 

It  is  sufficient  to  show  that  for  all  /  E  the  space  of  infinitely  differentiable, 
compactly  supported  functions,  ( Pjf,f )  -»  0  as  j  — >  oo.  Here  Pjf  is  the 
orthogonal  projection  of  /  onto  Vj.  As  a  consequence  of  II.  11  the  </>j*  form  an 
orthogonal  set.  Thus,  {<f>jk}  constitutes  an  orthonormal  basis  for  Vj,  and  the 
orthogonal  projector  P_j  can  be  expressed  as 

pjf  =  53  /) 


W,  />  =  ( E  /)  / )  =  53  Kfc*.  />i 


and  if  we  assume  /  has  support  [—a,  a 


l(0ifc»  /)|2  =  f  f  (x)  2  j/2<f>  (2~ix  -  k)(h 
J  ~a 


f  (x)  (j)  (2  3x  —  k)dx\  . 


As  a  consequence  of  Holder’s  inequality 


f  /  (x)  (j>  (2  ix  —  k)dx  <  ||/||  (  f  <j>  (2  Jx  —  k)  <p  (2~-?x  —  k)dx 
J — a  \J —a 


f  f  (x)  (j)  (2-J'x  -  k)dx  <2  3  ||/f  f  \4>{2jx-k)\ 

J  — cl  J  — a 

<  ll/ll2  [  \<t>{y)\2dy. 

J  ll/d-Zul  “C2  3  CL 


'\y+k\<2~ia 
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Observe  that  as  j  — >  oo  the  integral  on  the  right  hand  side  goes  to  zero 
independent  of  k.  Also,  each  \((f>jk,  f)f  is  non-negative.  Combining  these  two 
facts  we  see 


o  <{p}f,!)<  ii/f  y_  f 

thus,  Djez  Yi  =  {°}- 


/ 

i.  J \y+k\<2~3  a 


\(f>(y)\2dy  ->■  0  as  j  -»•  oo 


These  properties,  while  very  helpful,  leave  us  somewhat  short  of  our  goal  of 
producing  a  basis  for  L2.  The  most  obvious  shortcoming  is  that  we  have  not  shown 
that  any  collection  of  these  functions  span  L2.  Before  we  address  the  issue  of  spanning 
L2,  let  us  consider  a  property  of  <3>,  the  Fourier  Transform  of  <j>.  As  we  shall  see,  the 
following  limit  is  of  interest  to  us 


/OO 

I  $  (2~3'oj)  1 2  I g  (uj)  I2  du 

-00 


J-+00 

for  arbitrary  g  &  Cq°.  In  [Ref.  1]  we  see  that  subject  to  the  conditions  already 
imposed  on  {c*},  $  is  an  entire  function.  Additionally,  we  have  seen  $  (0)  =  1,  and 
<  1  for  z  €  R.  We  can  now  use  Lebesgue’s  bounded  convergence  theorem 
[Ref.  6]  to  show 

/oo  r  oo 

\<f>(2-ju>)\2\g(u)\2du}=  /  \g{oo)\2duj.  (III.l) 

-oo  J — oo 

Now,  let  us  get  back  to  trying  to  span  L2  with  some  collection  of  <f>jk-  What 


we  wish  to  show  is  Ujez^i  =  L2-  Since  the  Vj  are  nested,  it  will  be  sufficient  to  show 
that  ( P-jf ,  /)  — V  (/,  /)  as  j  — *  oo  for  all  /ef2  such  that  /,  the  Fourier  Transform 
of  /,  is  in  C™.  Let  f  E  L2  such  that  /  =  g  G  C“.  As  we  have  seen  before  the 
orthogonal  projector  P_j  can  be  expressed  as 


P-jf  —  'y  (/)  ty-jk)  fi-jk- 


Jtez 


Again  we  see 


(p-if,f)=Y,  i  (/>-*>  i2. 
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Recalling  that  the  Fourier  Transform  we  are  using  has  the  normalization  constant 
in  the  inverse  transform,  inner  products  in  the  function  space  are  related  to  inner 
products  in  the  transform  space  by  (</>,  ip)  =  2tt  ($,  T).  By  standard  operational  rules 
for  Fourier  Transforms  we  can  see  the  Fourier  Transform  of  as  defined 

above  is  given  by 


<$>-jk  =  e-2^2-t/2$ 


All  of  this  now  leads  us  to 

/OO  _____ _ 

9  (w)  2 -j/*e2~ikiu$  (: 2-iu)du . 

■oo 

Simplifying  where  possible  and  making  the  change  of  variable  2  =  2~jco  we  get 

/OO 

g  (2 jz)  $  (2)  e~kizdz. 

•OO 

Recall  that  g  has  compact  support,  so  for  sufficiently  large  j,  g  (2 jz)  is  nonzero  only 
in  [ — 7r ,  7rJ  and  we  can  simplify  the  integral  to 

</,  4>-jk)  =  2tt2-^2  [  g  (2 jz)  $  (2)  e-fci2d2.  (III.2) 

Now,  the  right  hand  side  of  III.2  is  just  the  kth  Fourier  coefficient  of  the  function 
2~j/227rg  (2jz)  $  (2).  The  quantity  we  are  interested  in,  namely  ( P-jf,f ),  is  given 
by  the  sum  over  k  of  the  squares  of  the  magnitudes  of  the  individual  inner  products 
given  in  III. 2.  Since  each  of  these  individual  inner  products  is  a  Fourier  coefficient 
for  a  specific  function,  we  can  use  Parseval’s  Equality  to  arrive  at 

(P-jf,  f)  =  J2  2n2~j/2  f  9  (2^)  d>  (z)  e~kizdz  2 

k 

=  lf  \2~j/227rg(2jz)^(z)\2dz. 

Collecting  terms  and  simplifying  where  possible  we  get 


(HI-3) 


(P-if,  f }  =  2tt2~^  r  | g  (2 h)  d>  (2)  I2  dz. 

J  —  7T 
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If  we  now  revert  to  the  original  variables,  that  is,  let  2J z  —  co  we  get 

/2J7r  poo 

(2~Ju;)|  du)  =  2tt  /  \g  (w)|2  |$  (2-,w)  |  du. >. 

-2j7T  J  —  OO 

We  have  seen  in  III.l  that  the  right  hand  integral  above  converges  to 


Combining  all  we  have  done  we  get 

/CO 

1 9  M  \2dw  =  2n  (g,  g)  =  (/,  /) . 

•OO 

Which  shows  (Jjez  Vj  =  L2- 

So  far,  we  have  shown  the  existence  of  a  sequence  of  orthonormal  sets  of 
functions  the  limit  of  which  is  dense  in  L2.  It  might  appear  that  we  are  nearing  the 
end  of  our  search  for  basis  functions  on  which  to  base  our  wavelet  transform.  What 
we  have  is  a  good  foundation  for  an  approximation  scheme,  however  this  scheme 
falls  short  of  satisfying  all  the  conditions  we  need  for  our  basis  functions.  Namely, 
projecting  onto  Vj  for  fixed  j  doesn’t  provide  any  frequency  information.  If  we  attempt 
to  get  frequency  information  by  projecting  onto  other  subspaces  Vn  for  various  n  7^  j, 
we  no  longer  have  a  orthonormal  system,  since  4>jk  is  not  orthogonal  to  4>nm  for 
j  7^  n  as  either  Vj  C  Vn  or  Vn  C  Vj.  This  leads  us  to  introduce  the  detail  space 
Wj  associated  with  Vj,  which  we  will  call  the  scaling  space.  Formally,  Wj  is  the 
orthogonal  complement  of  Vj  in  Vj-\.  This  allows  us  to  express  Vj_\  as  the  direct 
sum  of  Vj  and  Wj.  We  can  express  this  graphically  as 

Vj  -»  Vj +i  -*  Vj+  2  -»•  Vj+ 3  ->■••• 

\  \  \  \ 

Wi+i  wj+2  Wj+  3  ... 

Consider  the  sequence  of  spaces  Wj,  since  each  is  the  orthogonal  complement  of 
its  companion  scaling  space  Vj  and  Wj+ \  is  a  subspace  of  Vj,  it  is  clear  that  Wj 
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is  orthogonal  to  Wj+\.  Additionally,  we  saw  earlier  that  =  {0} ,  this  implies 

Ujez  Wj  =  L2.  Since  the  W/s  are  all  orthogonal,  we  have  a  direct  sum  decomposition 
of  L2  namely, 


L2  =  0w3, 

jez 

This  certainly  is  helpful,  as  all  we  need  to  do  now  is  find  an  orthogonal  basis  for  each 
Wj.  Taking  the  union  of  all  these  basis  functions  will  yield  an  orthogonal  basis  for 
L2. 

This  may  seem  to  be  nearly  as  daunting  a  task  as  we  initially  faced,  but  we 
can  now  take  advantage  of  the  highly  structured  nature  of  the  Vj  spaces  and  the 
orthogonality  of  Wj  to  the  Vj.  First,  observe  that  since  Wj  c  Vj— i,  perhaps  we  can 
use  the  orthogonal  basis  for  Vj-i  to  construct  an  orthogonal  basis  for  Wr  Let  us 
proceed  along  these  lines.  Any  prospective  basis  function,  <p,  for  W0,  must  be  an 
element  of  Wo  and  since  Wo  C  V-\  we  must  have  the  following  representation  for  <p 

<p  =  J2dl<f>(  2x-l).  (III.4) 

l 

Such  functions  <p  are  the  wavelets  which  we  have  alluded  to  throughout  this  paper. 
Since  we  are  interested  in  compactly  supported  basis  functions  we  assume  {di}  to  be 
finite.  We  also  require  to  be  orthogonal  to  d>  and  all  its  integer  translates,  since 
Wq  is  the  orthogonal  complement  of  Vo.  To  aid  us  in  our  investigation  we  use  the 
following  representation  of  the  Poisson  Summation  Formula,  a  derivation  of  which  is 
found  in  the  appendix, 

J2  (<P  ( x )  ,4>(x-  0)  e~iwl  =  V  (w  +  2 kn)  d>  (w  +  2kn)  for  k,l  e  Z.  (III.5) 

i  k 

Observe  that  this  formula  establishes  the  equivalence  of  the  following  statements: 

1.  ip(x)  is  orthogonal  to  (f)  (x  —  l)  for  l  e  Z. 

2-  X)fcez  ^  (w  +  2kir)  $  (u  +  2kn)  =  0  for  all  w6R. 
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We  can  see  this  since  if  1  is  true  then  the  sum  on  the  left  hand  side  of  III.5  is 
clearly  zero  for  all  u  €  R.  Conversely,  if  2  is  true  then  the  left  hand  side  of  III.  5  is 
a  Laurent  series  in  ew  which  is  zero  at  all  values  of  u,  and  must  be  identically  zero, 
therefore  each  of  the  inner  products  in  the  sum  must  be  zero. 

Now,  let  us  apply  this  to  the  problem  of  finding  a  basis  for  W0.  Suppose  cp  (x) 
is  orthogonal  to  4>  (x  —  k)  for  k  €  Z,  this  is  equivalent  to 


*  (u  +  2tor)  *  (w  +  2for)  =  0  for  all 

k 

We  can  use  III. 4  to  get  an  expression  for  \V,  the  Fourier  Transform  of  (p 
*  (0  =  J  J2  M  (2x  -  l )  e-^dx  =  (|)  • 

As  we  have  seen  before,  the  Fourier  Transform,  4>,  of  o  can  be  expressed 


Consider  the  function 


*(0 


(III.6) 


F(0  =  J>(?  +  2fer)|2. 

k 

We  can  write  this  function  in  the  following  way, 

F(£)  =  ]T4>(£  +  2&7t)  <E>  (£  4-  2kir). 

k 

Now,  by  III. 5  F  is  equal  to  Y2i  {4>  (x)  >  <t>  (x  —  l))  =  1  because  (f>ok  form  an  orthonormal 
basis  for  Vo-  So,  F  (f)  =  1  for  all  (el 

Define  the  following  trigonometric  polynomials 

o(0  = 

I 

o(o  = 
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Substituting  into  III. 6  we  get 


o  =  ]T>(w  4-  2kir)  $  (u>  +  2kn) 


y'  D  +  2fo ^  +  2kn ^  +  2A:7r^  ( u  4-  2kir 


(in. 7) 


Observe  when  k  is  even  D  (^f^)  =  £>  (f),  and  when  k  is  odd  D  = 

0(2  +71").  Similarly,  when  k  is  even  C  (“+%klT)  =  C  ,  and  when  k  is  odd 
C  (^+2fc7r)  _  (^  +  7r).  We  can  rewrite  the  sum  as 


E 


D 


!Mf) 


$ 


4-  2  (2A:)  7r' 


2  > 

1 

fto  \ 

(LO  \ 

-  +  7T)  C 

-  +7I-) 

\  2  J 

(2  J 

$ 


a;  4-  2  (2&  4-  1)  7r 


.  (HI. 8) 


Factoring  out  the  terms  which  are  independent  of  k  we  get 

, | $  [^  +  2kn)' 

k 


*(IMi)EN!+H 


+  £)  (|  +  tt)  C  (|  +  tt)  |$  (|  +  7T  +  2A:tt)  .  (III.9) 

/c 

The  summations  above  are  simply  F  (^)  and  F  (ij  -f  tt)  so  we  can  write 
T  (a;  4-  2&7t)  $  (a;  4-  2A;7r)  = 


D  (!) c  (!) ; F  (i) + D  (I + *)  c  (f + *)f  (f + *)  ■  (“no) 

But  F  (^)  =  F  (^  +  7r)  =  1,  so  we  have 


0  =  +  2k7r)®  +  2k”)  =  V  (^)  C  +D(|+7r)c(|+7r). 

(III.  11) 

To  simplify  our  notation  we  will  use  u>  in  place  of  j.  Since  <f>ok  form  an  orthonormal  set, 
C  (to)  and  C  (u  +  tt)  cannot  simultaneously  be  zero;  this  is  a  consequence  of  the  third 
condition  of  Theorem  II. 1.  If  III. 11  is  to  be  true  then  when  C  (u>)  is  zero,  C  (to  4-  tt) 
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cannot  be  zero.  Therefore,  D  (cj  +  7r)  must  be  zero.  Similarly,  C  (to  +  n)  and  D  (cj) 
must  have  the  same  zero  structure.  Since  C  (cj)  is  a  trigonometric  polynomial,  D  (cj) 
must  have  the  following  form 

D  (cj)  =  A  (cj)  C  (cj  +  7r). 

Where  A  (cj)  is  27r-periodic,  and  A  (cj  +  7r)  =  —A  (cj)  .  Further,  since  we  desire  tp 
to  be  compactly  supported,  A(cu)  must  be  a  trigonometric  polynomial.  Now,  III. 11 
becomes 

0  =  A  (u>)  C  (oJ  +  7r)C  (u>)  +  A  (u>  +  ft)  C  (ui  +  7T  +  7r)C  (iO  +  7r) 

=  A  (cj)  C  (cj  T  7r)(7  (a;)  —  A  (a ;)  C  (a ))C  (u)  +  7r) 

since  C  (cj)  is  27r-periodic.  Thus  we  have  shown,  subject  to  the  above  conditions  on 
A  (cj),  functions  generated  by  III. 4  lie  in  the  orthogonal  complement  of  Vo  in  V-i- 
What  remains  to  be  shown  is  that  these  functions  ip  are  orthogonal  to  their  integer 
translates. 

To  show  this  we  will  again  use  the  Poisson  Summation  Formula  III. 5.  We  see 
that  orthogonality  of  (p(x)  to  <p(x  —  l)  for  l  eZ  is  equivalent  to 

i  =  £*  (cj  +  2/:7r)  (cj  +  2kn). 

k 

Substituting  in  the  expression  for  the  Fourier  Transform  of  <p  and  simplifying  in  a 
manner  similar  to  that  used  above  we  get 

1  =  ^2  ^  (u  +  2&7t)  (cj  +  2kir)  =  D(u)D  (cj)  +  D  (cj  +  7r)  D  (cj  +  7r).  (III. 12) 

k 

Since  D  (cj)  —  A  (cj)  C  (cj  +  7r),  we  have 

1  =  A  (cj)  C  (cj  +  it)  A  (cj)C  (cj  +  7r)  +  A  (cj  T  7t)  C  (c o)A  (cj  +  tt)C  (cj)  . 
Reorganizing  slightly  and  recalling  that  A  (cj  +  7r)  =  -  A  (cj)  we  find 

1  =  A  (cj)  Aju)C  (cj  +  t r)  C  (cj  +  tt)  +  A  (cj)  A(cj)C  (cj)  C(cj). 
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Since  <f>  (a;)  is  orthogonal  to  <f>(x  -  l),  we  know  C  (lo  +  it)  C  (u  +  ir)  +  C  (u)  C  Ju)  =  1. 
If  we  additionally  require  |^4(w)|  =  1,  then  III. 12  will  be  satisfied,  and  (p  (x)  will  be 
orthogonal  to  cp(x  —  l). 

Let  us  collect  the  conditions  we  have  imposed  on  A  (a;)  and  see  what  type 
function  they  characterize. 

1.  A  (u)  is  a  trigonometric  polynomial,  and  27r-periodic. 

2.  A  ( 00  +  7r)  —  — A  (u>). 

3.  |A(w)|  =  l. 


Condition  3  implies  the  graph  of  A  (a;)  traces  out  the  unit  circle  or,  in  light 
of  condition  1,  A  (a;)  =  a0e~imLO  for  |a0|  =  1  and  some  m.  Condition  2  implies  m 
must  be  odd.  We  have  a  great  deal  of  freedom  in  our  selection  of  a0,  but  since 
we  have  so  far  only  considered  real  valued  {d/},  we  will  limit  ourselves  to  ao  =  ±1 
and  for  this  development  we  will  assume  a0  —  1.  We  will  let  m  be  the  highest 
index  of  the  set  { ck } that  is,  if  {cfc}  has  four  elements  we  will  set  m  =  3,  because 
{Ck}  =  {c0,c1,c2,c3}. 

Having  made  these  choices  we  see 

m 

D  (uo)  =  eim“-  ^  cke~ik^). 

2  fc= o 

Or,  after  simplifying 

1  m 
1  k= 0 

which  we  can  express  as 


D(  w)  = 


1 

2 


m 


J2die~il“ 


1  Condition  II.  11  implies  the  highest  index  of  {c*}  must  be  odd.  Further  any  odd  m  gives  rise  to 
a  translated  version  of  this  same  (p. 
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where  dt  =  c^rt.  That  is  {di}  =  {-c^,c^Z 7,  •  •  •  ,c7,  -c7,  c^}.  Recall 

this  {dj  is  such  that  <p  (x)  =  ^  di<f>  (2x  —  l)  and  its  integer  translates  (p(x  —  k)  form 
an  orthonormal  set  in  Wq. 

It  is  not  obvious  that  this  set  spans  W0.  We  can  see  that  it  does  by  employing 
an  argument  similar  to  that  above.  Suppose  g  (x)  €  Wo,  then  g  (x)  6  V-i  and 
g  (x)  _LVo-  We  can  express  g  (x)  as 

9  0*0  =  ^2  7 \4>  (2x  —  l) . 

i 

This,  in  turn  gives  us  G  (£),  the  Fourier  Transform  of  g  (x) 

G(f)  =  Iy>-  *.(§). 

Define  the  Laurent  series  in  e~*^,  T  (£)  by 

1  l 

Since  g  (x)  J_Vo,  we  can  obtain  the  following  relationship  between  C  (£)  and  T  (£) 

o=r(5M!)+r(Hc(M- 

Now,  by  exploiting  the  zero  structure  of  C  (£)  we  can  see  T  (£)  must  have  the  following 
form 

r({)  =  b  ({)<?({ +  *), 

where  B  (£)  satisfies  the  following: 

1.  B  (£)  is  a  Laurent  series  in  e-*^. 

2.  B  (£)  is  27r-periodic. 

3.  B{t  +  ir)  =  -B(t). 

Conditions  2  and  3  above  imply  B  (£)  consists  solely  of  odd  powers  of  e~^. 
Each  of  these  odd  powers  in  turn  corresponds  to  a  translation  of  the  function  cp.  So, 
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any  function  g  ( x )  €  W0  can  be  represented  by  a  weighted  sum  of  ip  (x  —  k).  Thus  we 
have  shown  ip(x  —  k)  span  the  space  Wo. 

To  summarize,  so  far  we  have  a  sequence  of  spaces  Wj  the  direct  sum  of  which 
is  L2 .  We  have  demonstrated  a  basis  {<p(x  —  />:)}  for  Wo  which  is  constructed  from 
functions  (j)  (2x  —  l)  which  form  a  basis  for  VLj.  Now  we  need  to  show  an  orthonormal 
basis  for  Wj  is  given  by  {pjk  =  2~^2p  (2~3x  —  A;)}.  To  do  this  we  use  III. 5.  Prom 
standard  Fourier  Transform  operations  we  can  see 

=  2j/2e~Vkl^  (2Je) 

$jk  (£)  =  2j/2e~2jkiZ<$>  (2j£)  . 

It  is  easily  verified  using  arguments  similar  to  those  above  that  ipjk  is  orthogonal  to 
(j)jk.  It  is  equally  easy  to  verify  (<pjk,  <Pjn )  =  fin,k-  Thus  for  each  scaling  level  j  we 
have  an  orthonormal  set  {pjk},  which  lies  in  Wj.  All  that  remains  to  show  is  that 
each  set  spans  the  respective  space  Wj.  This  is  accomplished  in  a  manner  completely 
similar  to  that  used  for  Wo.  Finally,  we  have  an  orthonormal  basis  for  L2,  namely, 
{pjk  =  2~j/2<p  ( 2~3x  —  k)}  for  j,  k  e  Z. 

B.  ALGORITHM  DEVELOPMENT 

Armed  with  this  basis  for  L 2  we  now  set  out  to  implement  a  discrete  transfor¬ 
mation  which,  as  promised  in  the  introduction,  has  good  time-frequency  localization, 
suitable  power  of  approximation,  and  can  be  computed  quickly.  Time  localization  is 
a  result  of  the  compact  support  of  the  scaling  functions  and  wavelets,  and  the  fact 
that  the  support  of  these  functions  shrinks  as  the  index  of  the  subspaces  Vj  and 
Wj  decrease  (toward  — oo).  Frequency  localization  is  accomplished  by  dividing  the 
spectrum  of  the  input  function  into  “octaves”  each  of  which  is  represented  by  the 
projection  of  input  function  onto  some  Wj.  This  is  not  as  sharp  as  the  frequency 
localization  of  the  Fourier  Transform,  but  it  is  still  quite  good,  and  can  be  computed 
rapidly.  The  entire  derivation  of  the  basis  functions  was  driven  by  the  desire  to  take 
advantage  of  a  recursive  relation  between  basis  functions.  We  do  not  have  a  recursive 
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relation  between  the  actual  basis  functions  <pjk,  however,  we  do  have  a  linear  relation 
between  the  basis  functions  and  a  set  of  auxiliary  functions  <f>jk,  which  we  call  the 
scaling  functions 

ip(x)  =  ^2  di<j>  (2x  -  l) .  (III.  13) 

i 

This  would  not  be  of  much  use  without  the  associated  recursive  relation  between  the 
scaling  functions  at  subsequent  levels  of  scaling 

<f>(x)=^2ck(f>(2x-k).  (III.  14) 

k 

We  will  exploit  these  relations  to  achieve  a  computational  efficiency  of  order  N  opera¬ 
tions.  In  computing  a  transform  we  are  concerned  with  projections.  Since  projection 
operators  are  linear,  we  can  recast  III.  15  and  III.16  in  terms  of  the  projection  of  an 
arbitrary  function  /  (x)  on  <f>jk  and  <pjk.  Specifically, 

(/  (x) ,  0  (x))  =  $></  (x)  ,<j)(2x-  k))  (III. 15) 

k 

and 

(/  (x) ,  ip  (x))  =  '}2dl(f  (x) ,  </>  ( 2x  -  k))  .  (III. 16) 

i 

Put  another  way,  these  relationships  allow  us  to  compute  the  projection  of  /  (x)  onto 
Wj  given  the  projection  of  /  (x)  onto  Vj~\. 

Before  we  can  develop  an  algorithm,  we  need  to  recognize  that  the  theory  we 
have  developed  so  far  defines  a  transform  which  operates  on  functions  in  L2,  square 
summable  functions  defined  on  the  real  numbers.  Further,  the  transform  projects 
the  input  function  onto  a  bi-infinite  sequence  of  subspaces  of  L2.  In  practice  we  are 
given  a  sequence  of  uniformly  sampled  values  of  the  input  function  instead  of  the 
actual  function.  We  will,  for  the  purposes  of  this  development,  assume  the  input 
function  /  is  zero  at  its  boundaries.  By  doing  this  we  can  “pad”  the  function  with 
zeros  and  apply  the  transform  as  if  the  function  were  defined  over  the  entire  real 
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line.  There  are  methods  of  treating  input  functions  which  are  not  zero  at  their 
boundaries,  but  these  methods  are  beyond  the  scope  of  this  development.  Next  we 
must  address  the  question  of  where  in  the  bi-infinite  sequence  of  subspaces  to  start 
the  transform.  The  sampled  values  of  /  are  a  piecewise  constant  approximation  to 
the  actual  input  function2.  We  will  define  the  fine  scaling  unit  to  be  the  “distance” 
(difference  in  independent  variable)  between  successive  samples.  A  consequence  of  the 
orthogonality  of  the  scaling  functions  is  that  they  must  have  approximation  power  at 
least  one.  This  is  easily  seen  since  \rrio  (z)|2  -f  \m0  (z  +  tt)  |2  =  1  for  all  z  €  R,  and 
mo  (0)  =  1,  mo  (7r)  =  0.  Put  another  way,  the  polynomial  with  coefficients  {c*,}  has  a 
root  at  7 r,  which  tells  us  the  scaling  function  associated  with  {c*,}  has  approximation 
power  at  least  one.  How  do  we  use  this  fact?  Over  the  region  where  /  is  represented 
by  a  constant,  the  scaling  function  and  its  translates  can  exactly  represent  /.  Thus 
the  projection  of  /  onto  the  wavelets,  cpjk,  at  this  scaling  level  is  zero.  Also,  at  any 
finer  scaling  level  the  projection  of  /  onto  the  wavelets  must  be  zero.  In  light  of 
these  facts  we  will  start  the  algorithm  at  scaling  level  l,  where  the  scaling  function 
2 ~l/2(f)  (2lx)  has  support  width  of  one  fine  scaling  unit.  We  call  this  the  finest  scaling 
level. 

We  now  outline  the  algorithm  to  compute  the  discrete,  orthogonal  wavelet 
transform.  Figure  3  depicts  the  arrangement  of  wavelet  and  scaling  functions.  The 
first  step  in  the  algorithm  is  to  calculate  the  projection  of  the  input  function  onto  the 
scaling  functions  at  the  finest  scaling  level.  There  are  many  ways  to  accomplish  this, 
one  way  is  presented  in  the  appendix.  Once  we  have  these  projections,  the  algorithm 
is  simply  a  matter  of  using  III.  15  and  III.  16  to  reorganize  the  data  into  the  projections 
onto  Vi+i  and  Wj+1.  This  is  accomplished  by  taking  weighted  sums  of  the  projections 
onto  Vj.  To  calculate  projections  onto  V)+1,  we  use  as  weights  { ck } .  To  calculate  the 

2  Other  approximations  may  be  used,  for  instance,  if  the  input  function  is  represented  in  a  B- 
spline  basis  of  order  two,  we  would  have  a  piecewise  quadratic  approximation  to  the  input  function. 
Similar  results  for  higher  order  approximations  can  be  developed  in  a  manner  similar  to  that  used 
for  piecewise  constant  approximation. 
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projections  onto  Wi+i,  we  use  as  weights  {d\}.  We  repeat  this  process  until  we  have 
calculated  the  projections  onto  all  of  the  subspaces  Vj  and  Wj,  for  l  <  j  <m.  Here  m 
is  the  index  such  that  the  scaling  function  2 (2mx)  has  width  of  support  equal 
to  that  of  /.  The  discrete,  orthogonal  wavelet  transform  of  the  function  /  is  given  by 
the  following  collection  of  projections 


W  (/)  =  {(/>  Vjk)  \  l  <  j  <m}U{(f,  4 )mfc )}  . 

The  following  is  a  pseudocode  version  of  what  we  have  just  described. 

j-0 


j=-3  (finest  scaling  level) 


weighted  sum  of 
fine  scaling 
functions  produces 
coarse  wavelet  or 
scaling  function 


support  of  scaling  functions  at  finest  scaling  level 
support  of  f(x) 

Figure  3.  Organization  of  Basis  Function 
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input : 

1 .  n  4-  1  dilation  equation  coefficients  for  the  wavelet  we  wish  to  use 

2.  sequence  of  N  values  which  represent  the  projection  of  /  onto  <f>ik  at  the 
finest  scaling  level.  N  =  n2v~l  +  1  for  some  p. 

begin  algorithm: 
for  j  =  l  —  m  —  1  :  0 
for  k  —  0  :  n2-?_1  —  3 
(/,  <t>jk)  =  V2  YZt  Ci-2 k  (/, 

(/>  <Pjk)  =  i/2  Yflttk  C'-2fc  (/> 

end 

end 

output  :W(/)  =  {(/,¥>#)  M  <  J  <  0}  U  {(/,  (pok)} 

This  brings  up  another  aspect  of  implementing  the  wavelet  transform  on  a 
computer.  The  sequence  of  subspaces  we  project  onto  is  bi-infinite.  We  saw  earlier 
that  we  could  pick  a  starting  index  based  on  the  resolution  with  which  /  was  rep¬ 
resented.  The  algorithm  to  compute  the  transform  must  stop  at  some  index.  What 
should  this  index  be,  and  what  are  the  consequences  of  stopping  the  algorithm  here? 
In  this  development  we  will  stop  the  transform  at  the  index  where  the  scaling  function 
has  support  width  equal  to  the  width  of  the  input  function  sample.  At  each  step  of  the 
transform  we  are  calculating  the  projections  of  /  onto  succeedingly  lower  frequency 
basis  functions  as  the  support  width  of  (pjk  increases.  Terminating  the  algorithm  at 
any  given  index  establishes  a  low  frequency  bound  below  which  we  have  no  frequency 
localization. 

This  section  has  presented  a  very  basic  outline  of  the  discrete,  orthogonal 
wavelet  transform  algorithm.  There  are  many  choices  as  to  the  indices  on  which  the 
algorithm  should  begin  and  terminate.  The  major  concept  of  the  algorithm  is  the  use 
of  the  relationships  III. 15  and  III. 16  to  rapidly  compute  projections  at  one  level  from 
those  at  a  preceding  level.  It  is  precisely  these  relationships  which  allow  the  wavelet 
transform  to  be  computed  in  order  N  operations  as  we  shall  now  see. 
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To  start  our  study  of  the  computational  efficiency  of  the  algorithm  we  will 
assume  the  input  function  is  given  as  N  sampled  values.  We  further  assume  that 
N  =  2pn  +  l  for  some  p  where  n  is  the  highest  index  of  {qt}.  We  divide  the  algorithm 
into  the  following  blocks. 


1.  Calculation  of  projections  at  finest  scaling  level:  Nearly  any  quadrature  rule 
can  be  used  to  calculate  these  projections  in  O  ( N )  operations. 

2.  Calculation  of  wavelet  coefficients:  At  each  level  j  of  the  p  levels  of  the  algo¬ 
rithm  we  must  calculate  n2J  —  2  wavelet  coefficients,  each  of  these  coefficients 
is  calculated  by  taking  a  weighted  sum  of  projections  onto  the  scaling  coeffi¬ 
cients  at  the  preceding  level.  The  computational  cost  for  the  weighted  sum  is 
n  +  1  multiplies  and  n  additions,  this  makes  the  total  cost  for  this  block 

v- 1 

Y^(n2j  -2)  (2n+l). 
i= o 

3.  Calculation  of  scaling  function  coefficients  :  At  each  level  we  must  calculate 
the  projections  of  the  function  onto  the  scaling  functions.  This  is  accomplished 
by  taking  a  weighted  sum  of  projections  onto  the  scaling  functions  at  the 
preceding  level.  The  computational  cost  is  identical  to  that  for  calculating  the 
wavelet  coefficients  above. 


If  we  sum  the  computational  cost  for  each  of  these  blocks  we  get  a  total 
algorithmic  cost  of 


p- 1 

O  (N)  +  2  Y^(n2j  ~  2) (2n  +  !)• 
j=o 

The  above  sum  is  dominated  by  2  (2 n  +  1)  2P  which  is  O  (N).  So,  we  have  developed 
a  transform  which  provides  good  time-frequency  localization,  can  be  used  as  an  ap¬ 
proximation  scheme,  and  is  rapidly  computed.  Some  applications  and  directions  for 
further  studies  are  discussed  in  the  next  chapter. 
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IV.  CONCLUSIONS  AND  DIRECTIONS 
FOR  FURTHER  STUDY 


We  have  shown  that  we  can  develop  a  transform  based  on  solutions  to  the 
dilation  equation.  This  transform  is  useful  because  we  can  alter  the  characteristics 
of  this  transform  simply  by  changing  the  dilation  equation  coefficients.  Also,  we 
can  compute  this  transform  quickly.  In  fact,  the  order  N  operations  required  for 
the  wavelet  transform  is  considerably  better  than  the  N  log  (JV)  operations  required 
for  the  fast  Fourier  Transform  for  large  N.  The  wavelet  transform  has  the  added 
advantage  of  time  localization.  A  major  advantage  of  the  Fourier  Transform  is  that 
the  basis  functions  are  eigenfunctions  for  an  operator  of  fundamental  importance, 
namely  the  Laplacian  operator.  The  wavelet  transform,  on  the  other  hand,  has  been 
restricted  predominantly  to  use  in  the  signal  processing  field.  In  this  field  the  wavelets 
time-frequency  localization  and  rapid  computation  are  great  assets,  and  projection 
onto  a  basis  of  eigenfunctions  is  not  always  as  desirable  as  these  other  properties. 

To  be  as  useful  as  the  Fourier  Transform,  wavelets  must  be  “well  behaved” 
under  some  operator  of  interest.  By  well  behaved  we  mean  the  image  of  the  wavelet 
under  the  operator  is  simply  related  to  the  wavelet  or  its  translations  and  dilations. 
Eigenfunctions  for  an  operator  are  the  ultimate  in  well  behaved  functions.  It  is 
clear  from  our  preliminary  work  in  Chapter  II  that  convolution  and  convolution  type 
operations  are  the  natural  operations  for  wavelets.  If  we  are  to  find  an  operator 
under  which  wavelets  would  be  well  behaved,  it  is  natural  to  first  consider  operators 
based  on  convolution  like  processes.  A  broad  field  of  such  operators  is  the  Zygmund- 
Calderon  group  of  operators.  A  description  of  these  operators  is  beyond  the  scope  of 
this  work,  but  this  is  a  burgeoning  area  for  wavelet  analysis. 

An  area  which  has  great  potential  is  that  of  tailoring  a  wavelet  to  be  well 
behaved  under  a  particular  operator.  To  see  how  we  might  tailor  a  wavelet  we  should 
consider  that  if  we  wish  our  wavelet  to  have  support  of  width  n  at  the  zeroth  scaling 
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level,  {cfc}  must  have  n  + 1  elements,  namely  Co, . . .  ,  cn.  We  still  desire  the  wavelets  to 
be  orthogonal,  so  we  must  meet  the  necessary  conditions  of  section  1.3.  Specifically, 
ckck-2rn  =  2 5m  foimeZ.  Each  of  these  conditions  can  be  represented  as  a  simple 
bi-linear  form 


C  b2n]C  25m, 

where  *  indicates  conjugate  transposition  and  D2m  is  the  2m  down  shift  matrix.  That 
is,  D2m  has  ones  on  its  2 mth  diagonal  and  zeros  elsewhere.  If  these  conditions  were 
linear  then  we  could  easily  determine  the  solution  space  for  this  system.  However, 
these  conditions  are  not  linear  and  systems  of  bi-linear  forms  are  not  well  studied. 
We  can  see  for  m  =  0  the  condition  is  |c^.|2  =  2.  We  can  also  see  for  each  positive 
value  of  m  the  condition  is  equivalent  to  the  corresponding  condition  for  —  m,  since 
c*D2mc  =  0  =  0*  =  c*D;m(c*)*  =  c*D  _2mc.  So,  we  have  reduced  the  number  of 
conditions  from  n  to  To  make  further  progress  we  need  to  be  able  to  determine 
the  solution  space  of  a  system  of  bi-linear  forms.  The  use  of  the  term  space  here  refers 
to  the  set  of  all  solutions,  not  necessarily  a  linear  space.  The  solutions  of  a  system  of 
bi-linear  forms  need  not  be  linear.  Once  we  have  the  solutions  to  this  system  we  would 
then  determine  which  of  these  solutions  could  be  written  as  linear  combinations  of  the 
vectors  { [w*] }  as  defined  in  section  1.2  where  p  is  our  desired  approximation  power. 
This  would  satisfy  the  sufficient  condition  for  approximation  power.  Now  we  check 
these  solutions  against  the  sufficient  condition  for  orthogonality  presented  in  section 
1.3.  The  result  would  be  a  collection  of  dilation  equation  coefficients  which  would 
yield  orthogonal  wavelets  with  the  desired  approximation  power.  Perhaps  among  this 
collection  we  would  find  dilation  equation  coefficients  which  would  be  well  behaved 
under  the  operator  of  interest. 
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APPENDIX  A.  DERIVATION  OF  POISSON 
SUMMATION  FORMULA 

We  start  with  the  Poisson  Summation  Formula  as  found  in  any  text  on  Fourier 
Analysis 

£/(")=£F(  2fer), 

where  F  (£)  is  the  Fourier  Transform  of  /  (x).  Consider  the  function 

/  (w,  l)  =  (ip  (x)  l ))  e~™1. 


We  see  the  Fourier  Transform,  in  the  variable  /,  of  /  is  given  by 


/OO 

(ip  (x)  ,<p(x-  /))  e~wle 

-OO 


Expanding  the  inner  product  and  simplifying  the  exponential  we  get 


/oo  poo  _ 

/  ip(x)<p(x-  l)da 

oo  J  —  oo 


Now,  change  the  order  of  integration,  and  we  see 


/OO  poo  _ 

ip(x)  <j>(x-  l)  e~il^+®dl 
-oo  J  —oo 


Make  the  change  of  variables  y  =  x  —  l,  and  we  now  have 


/oo  poo  _ _ 

ip  (re)  e~tx(w+®  /  (p  (y)  ely^w+^dy  dx. 

-oo  J  — OO 


We  can  write  the  integral  in  y  above  as 


This  leaves  us  with 


/OO  _  _ 

<j>  ( y )  eiy^+^dy  =  $  (w  +  £)• 

-OO 

/OO  _ 

ip  (x)  e~ix^+^  (a ;  +  Z)dx. 

-OO 
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Since  $  (u>  +  £)  has  no  x  dependence,  we  can  write  this  as 


/OO 

ip  (ar)  e-<l(w+?)«  (w  +  O- 

■OO 

The  integral  in  x  above  is  simply  the  4/  (a;  +  f )  which  leads  us  to 


Now,  applying  the  Poisson  Summation  Formula  we  get 

y:  (ip  (x)  ,<f>(x-  l ))  e~lwl  =  4>  (u  +  2kTi)  $  (lo  +  2kn), 

lei  ke  z 

which  is  the  desired  representation  of  the  Poisson  Summation  Formula. 
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APPENDIX  B.  PROJECTIONS  AT  FINEST 

SCALING  LEVEL 


To  see  how  we  might  accomplish  this,  consider  the  following  diagram. 


requires  special  treatment 
aligned  with  constant  f(x) 


i 


support  width  of  scaling  functions  at  finest  scaling  level 


Figure  4.  Scaling  Functions  at  Finest  Scaling  Level 


Each  projection  at  the  finest  scaling  level  is  the  projection  of  0**.  onto  either 
a  constant  or  piecewise  constant  function.  Where  /  is  constant  over  the  support  of 
<f>ik  the  projection  is  simply  a  weighted  integral  of  0ifc.  Where  /  is  piecewise  constant 
over  the  support  of  <pik  we  can  divide  the  domain  of  integration  so  that  the  projection 
is  given  by  a  sum  of  integrals  over  domains  where  /  is  constant.  Each  of  these  sub¬ 
integrals  is  given  by  a  weighted  integral  of  0**,  over  a  portion  of  its  support.  For 
instance,  if  we  are  computing  the  projection  of  /  onto  the  second  scaling  function  in 
Figure  4,  we  would  have  the  following 


/  /  (a;)  02  (x)  dx  =  a  02  (x)  dx  +  a  /  02  (x)  dx  +  b  02  (re)  dx. 
Jo  Jo  J 1  J  2 


Similarly,  the  projection  onto  the  third  scaling  function  would  be 


[  f  (z)  03  (£)  dx  =  a  f  03  (x)dx  +  b  (  03  (x)  dx  +  b  f  03  (x)  dx. 
Jo  Jo  J 1  J  2 
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The  sub-integrals,  fQ  ^>3  (x)  dx,  can  be  computed  once  and  stored  for  use  since  they 
depend  only  on  the  wavelet  used.  We  will  assume  the  projection  of  the  input  function 
onto  Wi  is  zero.  This  prevents  us  from  incorporating  into  the  transform  the  disconti¬ 
nuities  introduced  by  the  piecewise  constant  approximation  of  /.  It  is  worth  noting 
that  at  this  point  in  the  algorithm  we  have  all  the  information  we  ever  will  about  the 
input  function.  In  fact,  we  do  not  use  the  function  for  the  rest  of  the  algorithm.  The 
remainder  of  the  algorithm  is  devoted  to  rearranging  the  projections  of  /  onto  the 
scaling  functions  at  the  finest  level  into  projections  of  /  onto  the  spaces  Wj  for  j  >  l. 
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